A Partial Digest Approach to Restriction Site Mapping
نویسندگان
چکیده
We present a new, practical algorithm to resolve the experimental data in restriction site analysis, which is a common technique for mapping DNA. Specifically, we assert that multiple digestions with a single restriction enzyme can provide sufficient information to identify the positions of the restriction sites with high probability. The motivation for the new approach comes from combinatorial results on the number of mutually homeometric sets in one dimension, where two sets of n points are homeometric if the multiset of n(n-1)/2 distances they determine are the same. Since experimental data contain errors, we propose algorithms for reconstructing sets from noisy interpoint distances, including the possibility of missing fragments. We analyse the performance of these algorithms under a reasonable probability distribution, establishing a relative error limit of r = theta(1/n2) beyond which our technique becomes infeasible. Through simulations, we establish that our technique is robust enough to reconstruct data with relative errors of up to 7.0% in the measured fragment lengths for typical problems, which appears sufficient for certain biological applications.
منابع مشابه
Modeling of Partial Digest Problem as a Network flows problem
Restriction Site Mapping is one of the interesting tasks in Computational Biology. A DNA strand can be thought of as a string on the letters A, T, C, and G. When a particular restriction enzyme is added to a DNA solution, the DNA is cut at particular restriction sites. The goal of the restriction site mapping is to determine the location of every site for a given enzyme. In partial digest metho...
متن کاملRestriction site mapping is in separation theory
A computer algorithm for restriction-site mapping consists of a generator of partial maps and a consistency checker. This paper examines consistency checking and argues that a method based on separation theory extracts the maximum amount of information from fragment lengths in digest data. It results in the minimum number of false maps being generated.
متن کاملThe Simplified Partial Digest Problem: Hardness and a Probabilistic Analysis
Introduction We study the problem of genome mapping using restriction site analysis. In restriction site analysis, an enzyme cuts a target DNA strand into DNA fragments, and these DNA fragments are used to reconstruct the restriction site locations of the enzyme. Two common approaches are the Double Digest Problem and the Partial Digest Problem. The Double Digest Problem is known to be NP-Compl...
متن کاملComputational Biology Lecture 13: Physical mapping by hybridization
As mentioned before, we have two approaches for physical mapping: Restriction mapping and mapping by hybridization. We covered restriction mapping previously through the two problems of double digest and partial digest. We now look at mapping by hybridization. While restriction mapping involves the mapping of restriction sites (precise short sequences) of a cutting enzyme based on the lengths o...
متن کاملA Partial Digest Approach to Restriction
We present a new, practical algorithm to resolve the experimental data in restriction site analysis, which is a common technique for mapping DNA. Speciically, we assert that multiple digestions with a single restriction enzyme can provide suucient information to identify the positions of the restriction sites with high probability. The motivation for the new approach comes from combinatorial re...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Proceedings. International Conference on Intelligent Systems for Molecular Biology
دوره 1 شماره
صفحات -
تاریخ انتشار 1993